RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 
Source: 

Date Processed by STIC: S ~ Z 3 ~ O / w 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mail help: natin21heln(a),usr)to.eov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 c-mail help: natin3lielp@usnto.eov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1 821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1 , 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST 25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- *" 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/vveb/officcs/pac/checker 
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CompW 



3 <110> APPLICANT: University of Wales College of Medicine HoeS N° ^ Meed e ^ 

5 <120> TITLE OF INVENTION: Protein and DNA coding therefor ^ 0\sk©tt e 

7 <130> FILE REFERENCE: PCT/GB99/03654 C0tf eCX S~ 

c __> 9 <140> CURRENT APPLICATION NUMBER: US/09/831,142 ~ tL T / 

C— > 10 <141> CURRENT FILING DATE: 2001-05-07 p 
12 <160> NUMBER OF SEQ ID NOS : 22 
14 <170> SOFTWARE: Patentln Ver. 2.1 

16 <210> SEQ ID NO: 1 

17 <211> LENGTH: 870 

18 <212> TYPE: DNA 

19 <213> ORGANISM: Pholas dactylus 

21 <400> SEQUENCE: 1 

22 gaattcggca cgagtcggaa aagaacaaaa tggcttgtat cgttttcgtt gctcttgtcg 60 

23 ctctatgctt aatgcaaccg ggttccggtg aggaagtaca atgcgcgatg aattggacac .120 

24 aagctaatga atatgtgttc aacgtggact ggatgaccat tttcatctac gactatggcg 180 

25 ctcaagagca actgtacgaa gatcgggctt tggggctgtg tcggattgaa cgggccggcc 240 

26 caggtaccac aaaagccgtc tggattaact ggagtaacga cacgcagtca tgtgtaacaa 300 

27 gaaaaacaat cttcttcgag gttggtggag aaattgcccg gctagttgac tacagaccac 360 

28 aggaagacgg aactgagaaa acttttacaa gaaaattctc tagcaaaatg ccaggcactt 420 

29 acatgcttat ggacgtgtgc gctacaaggg acgctgatga taaatgcatc gaaggcacaa 480 

30 ttgtggtgac agtcagggtg tccctatatg acgaagataa caatggtgta atggatgaag 54 0 

31 gtaaggtgat tccatctgag acaatcgagg atgatatcaa ggactgtggg ctcttagacc 600 

32 aagatgttga actcgattat acgtggactc aaaacgagtg tgatctacca gacacagtag 660 

33 acgaggctga agacacaccg tcagaaactg gagaattctt ctggtagatc tatcagacta 720 

34 cttttatcag caggacaact ggtcgttacc agacacctat aacgtgtcct catcaataat 780 

35 gtgtaaaaca gaaataatcg atagaatatt gaaaataaaa tgttaataaa cactggttga 840 

36 aatatgaaaa aaaaaaaaaa aaaactcgag 870 

39 <210> SEQ ID NO: 2 

40 <211> LENGTH: 816 

41 <212> TYPE: DNA 

42 <213> ORGANISM: Pholas dactylus 

44 <400> SEQUENCE: 2 

45 gaattcggca cgagggaaaa gaacaaaatg gcttgtatcg ttttcgttgc tcttgtcgct 60 

46 ctatgcttaa tgcaaccggg ttccggtgag gaagtacaat gcgcgatgaa ttggacacaa 120 

47 gctaatgaat atgtgttcaa cgtggactgg atgaccattt tcatctacga ctatggcgct 180 

48 caagagcaac tgtacgagga tcgggctttg gggctgtgtc ggattgaacg ggccggccca 240 

49 ggtaccacaa aagccgtctg gattaactgg agtaacgaca cgcagtcatg tgtaacaaga 300 

50 aaaacaatct tcttcgaggt tggtggagaa attgcccggc tagttgacta cagaccacag 360 

51 gaagacggaa ctgagaaaac ttttacaaga aaattctcta gcaaaatgcc aggcacttac 420 

52 atgcttatgg acgtgtgcgc tacaagggac gctgatgata aatgcatcga aggcacaatt 480 

53 gtggtgacag tcagggtgtc cctatatgac gaagataaca atggtgtaat ggatgaaggt 54 0 

54 aaggttattc catctgagac aatcgaggat gatatcaagg actgtgggct cttagaccaa 600 

55 gatgttgaac tcgattatac gtggactcaa aacgagtgtg atctaccaga cacagtagac 660 

56 gaggctgaag acacaccgtc agaaactgga gaattcttct ggtagatcta tcagaccact 720 

57 tttatcagca ggacaactgg tcgttaccag acacctataa cgtgtcctca tcaataatgt 780 

58 gtaaaacaga aataatcgat agaatattga aaataa 816 
61 <210> SEQ ID NO: 3 
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62 <211> LENGTH: 852 

63 <212> TYPE: DNA 

64 <213> ORGANISM: Pholas dactylus 

66 <400> SEQUENCE: 3 

67 gtcggaaaag aacaaaatgg cttgtatcgt tttcgttgct cttgtcgctc tatgcttaat 60 

68 gcaaccgggt tccggtgagg aagtacaatg cgcgatgaat tggacacaag ctaatgaata 120 

69 tgtgttcaac gtggactgga tgaccatttt catctacgac tatggcgctc aagagcaact 180 

70 gtacgaggat cgggctttgg ggctgtgtcg gattgaacgg gccggcccag gtaccacaaa 240 

71 agccgtctgg attaactgga gtaacgacac gcagtcatgt gtaacaagaa aaacaatctt 300 

72 cttcgaggtt ggtggagaaa ttgcccggct agttgactac agaccacagg aagacggaac 360 

73 tgagaaaact tttacaagaa aattctctag caaaatgcca ggcacttaca tgcttatgga 420 

74 cgtgtgcgct acaagggacg ctgatgataa atgcatcgaa ggcacaattg tggtgacagt 4 80 

75 cagggtgtcc ctatatgacg aagataacaa tggtgtaatg gatgaaggta aggttattcc 540 

7 6 atctgagaca atcgaggatg atatcaagga ctgtgggctc ttagaccaag atgttgaact 600 

77 cgattatacg tggactcaaa acgagtgtga tctaccagac acagtagacg aggctgaaga 660 

78 cacaccgtca gaaactggag aattcttctg gtagatctat cagaccactt ttatcagcag 720 

79 gacaactggt cgttaccaga cacctataac gtgtcctcat caataatgtg taaaacagaa 780 

80 ataatcgata gaatattgaa aataaaatgt taatagacac tggttgaaaa aaaaaaaaaa 840 

81 aaaaaactcg ag 852 

84 <210> SEQ ID NO: 4 

85 <211> LENGTH: 225 

8 6 <212> TYPE: PRT 

87 <213> ORGANISM: Pholas dactylus 
8 9 <4 00> SEQUENCE: 4 

90 Met Ala Cys He Val Phe Val Ala Leu Val Ala Leu Cys Leu Met Gin 

91 1 5 10 15 

93 Pro Gly Ser Gly Glu Glu Val Gin Cys Ala Met Asn Trp Thr Gin Ala 

94 20 25 30 

96 Asn Glu Tyr Val Phe Asn Val Asp Trp Met Thr He Phe He Tyr Asp 

97 35 40 45 
99 Tyr Gly Ala Gin Glu Gin Leu Tyr Glu Asp Arg Ala Leu Gly Leu Cys 
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200 Arg lie Glu Arg Ala Gly Pro Gly Thr Thr Lys Ala Val Trp lie Asn 

201 65 70 75 80 

203 Trp Ser Asn Asp Thr Gin Ser Cys Val Thr Arg Lys Thr lie Phe Phe 

204 85 90 95 

206 Glu Val Gly Gly Glu He Ala Arg Leu Val Asp Tyr Arg Pro Gin Glu 

207 100 105 110 

209 Asp Gly Thr Glu Lys Thr Phe Thr Arg Lys Phe Ser Ser Lys Met Pro 

210 115 120 125 

212 Gly Thr Tyr Met Leu Met Asp Val Cys Ala Thr Arg Asp Ala Asp Asp 

213 130 135 140 

215 Lys Cys He Glu Gly Thr He Val Val Thr Val Arg Val Ser Leu Tyr 

216 145 150 155 160 

218 Asp Glu Asp Asn Asn Gly Val Met Asp Glu Gly Lys Val He Pro Ser 

219 165 170 175 

221 Glu Thr He Glu Asp Asp He Lys Asp Cys Gly Leu Leu Asp Gin Asp 

222 180 185 190 

224 Val Glu Leu Asp Tyr Thr Trp Thr Gin Asn Glu Cys Asp Leu Pro Asp 

225 195 200 205 

227 Thr Val Asp Glu Ala Glu Asp Thr Pro Ser Glu Thr Gly Glu Phe Phe 

228 210 215 220 

230 Trp 

231 225 

234 <210> SEQ ID NO: 7 

235 <211> LENGTH: 17 

236 <212> TYPE: DNA 

237 <213> ORGANISM: Pholas dactylus 

239 <220> FEATURE: 

240 <221> NAME /KEY : modif ied_base 

241 <222> LOCATION: (3) 

242 <223> OTHER INFORMATION: i 
244 <400> SEQUENCE: 7 

W— > 245 acnathttyt tycargt 17 

248 <210> SEQ ID NO: 8 

249 <211> LENGTH: 17 

250 <212> TYPE: DNA 

251 <213> ORGANISM: Pholas dactylus 

253 <220> FEATURE: 

254 <221> NAME /KEY: modif ied_base ~p 

255 <222> LOCATION: (15) ✓ ' ' 

256 <223> OTHER INFORMATION: 



Dase 



258 <4 00> SEQUENCE : 8 



W — > 259 cargargarg ^(n^cnga 17 

262 <210> SEQ ID NO: 9 

263 <211> LENGTH: 17 

264 <212> TYPE: DNA 

265 <213> ORGANISM: Pholas dactylus 

267 <220> FEATURE: 

268 <221> NAME /KEY : modif ied_base 

269 <222> LOCATION: (3) / 
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270 <223> OTHER INFORMATION: i ' , . ; s ? 

212 <400> SEQUENCE: 9 *'/J " c / JOS'™ 0 *' 7 & ' 

W— > 273 tcng^gjccyt cytcytg ' 17 

276 <210> SEQ ID NO: 10 

277 <211> LENGTH: 18 

278 <212> TYPE: DNA 

279 <213> ORGANISM: Pholas dactylus 

281 <220> FEATURE: 

282 <221> NAME /KEY: modif ied_base 

283 <222> LOCATION: (9) ^ 

284 <223> OTHER INFORMATION: i 
286 <400> SEQUENCE: 10 

W — > 287 ttyaaygtng aytggatg 18 

290 <210> SEQ ID NO: 11 

291 <211> LENGTH: 20 

292 <212> TYPE: DNA 

293 <213> ORGANISM: Pholas dactylus 

295 <400> SEQUENCE: 11 

296 acacagcccc aaagcccgat 20 

299 <210> SEQ ID NO: 12 

300 <211> LENGTH: 20 

301 <212> TYPE: DNA 

302 <213> ORGANISM: Pholas dactylus 

304 <400> SEQUENCE: 12 

305 ttgcccggct agttgactac 20 

308 <210> SEQ ID NO: 13 

309 <211> LENGTH: 24 

310 <212> TYPE: DNA 

311 <213> ORGANISM: Pholas dactylus 

313 <400> SEQUENCE: 13 

314 catatttcaa ccagtgttta ttaa 24 

317 <210> SEQ ID NO: 14 

318 <211> LENGTH: 19 

319 <212> TYPE: DNA 

320 <213> ORGANISM: Pholas dactylus 

322 <400> SEQUENCE: 14 

323 caattgtgcc ttcgatgca 19 

326 <210> SEQ ID NO: 15 

327 <211> LENGTH: 17 

328 <212> TYPE: DNA 

329 <213> ORGANISM: Pholas dactylus 

331 <400> SEQUENCE: 15 

332 ggactgtggg ctcttag 17 

335 <210> SEQ ID NO: 16 

336 <211> LENGTH: 20 

337 <212> TYPE: DNA 

338 <213> ORGANISM: Pholas dactylus 

340 <400> SEQUENCE: 16 

341 atggcttgta tcgttttcgt 20 
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L:9 M:270 C: Current Application Number differs, Replaced Current Application Number 

L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:245 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:7 

L:259 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:8 

L:273 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:9 

L:287 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:10 



file ://C :\Crf 3\Outhold\VsrI83 1 142 . htm 



5/23/01 



